A lower bound on the dimension of a quantum system given measured data 
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We imagine an experiment on an unknown quantum mechanical system in which the system is 
prepared in various ways and a range of measurements are performed. For each measurement M 
and preparation p the experimenter can determine, given enough time, the probability of a given 
outcome a: p(a\M,p). How large does the Hilbert space of the quantum system have to be in 
order to allow us to find density matrices and measurement operators that will reproduce the given 
probability distribution? In this note, we prove a simple lower bound for the dimension of the 
Hilbert space. The main insight is to relate this problem to the construction of quantum random 
access codes, for which interesting bounds on the Hilbert space dimension already exist. We discuss 
several applications of our result to hidden variable, or ontological models, to Bell inequalities and 
to properties of the smooth min-entropy. 



Loosely speaking, the dimension of the Hilbert space 
describing a quantum mechanical system limits the com- 
plexity or usefulness of the correlations that can be gen- 
erated by experiments on the system. For example, it 
has been suggested that the primary resource for quan- 
tum computation is Hilbert space dimension [l[ . In prac- 
tice though, when an experimentalist is faced with a real 
physical system the dimension of the Hilbert space is of- 
ten infinitely large in principle. The dimension of the 
Hilbert space that we use to describe the system of in- 
terest usually depends on the approximation used to de- 
scribe the physics of the system and may well depend on 
how well the experiment has in fact been set up. For this 
reason it would be of interest to be able to use the corre- 
lations observed in experiment to find strict lower bounds 
on the dimension of Hilbert space. Thus one could con- 
clude based on experimental data that the Hilbert space 
dimension of some system of interest was necessarily large 
and that the system could not be effectively approxi- 
mated by a smaller one. In this paper we show that it is 
certainly possible to derive very general lower bounds on 
Hilbert space dimension given experimental data. 

The properties of quantum correlations have been best 
studied in the setting of the Bell experiment. Imag- 
ine two parties, Alice and Bob, who are given access 
to shared quantum states \^}ab, but cannot commu- 
nicate. Each of them now performs a randomly chosen 
measurement on I^ab and records their measurement 
outcome. In order to obtain an accurate estimate for 
the correlation between their choice of measurement set- 
tings and measurement outcomes, Alice and Bob now 
perform this experiment many times, using an identically 
prepared state |vE') j 4b in each round. Quantum mechan- 
ics imposes strict limits on the strength of such non-local 
correlations, and it has been shown that we can com- 
pute bounds on these correlations for any such experi- 
ment 

HHH- (These bounds generalize the well known 
Tsirelson inequalities [H, [|| that apply to conventional 



Bell experiments that test the Clauser-Horne-Shimony- 
Holt inequality.) In particular, if we let p(a, b\s, t) be the 
probability that Alice and Bob obtain measurement out- 
comes a € A and b € B when performing measurements 
indexed by s £ S and t € T, we can test using the meth- 
ods of @, H, IU whether there exists a shared state 
and measurement operators M° and M t b for Alice and 
Bob such that 

p(a,b\s,t) = (^\M^M^), 

for all a,b,s, and t. But how large does the dimension of 
the Hilbert space have to be such that we can find such 
a state and measurements? 

Unfortunately, the methods of @, H, 0] do not give 
us any bound on the dimension in general. It is known 
that in the special case of two-party correlations, where 
Alice and Bob perform measurements using observables 
with eigenvalues ±1 (also known as XOR-games with 
A = B = {0, 1}), the dimension of the entangled state 
does not need to be larger than d = 2", where n = 
min(|5|, |T|) [1, @. Results are also known for a certain 
sets of 2-outcome inequalities [71 |8j . Very little is known 
otherwise. Even though one can construct an inequality 
with an infinite number of settings that requires an in- 
finitely large entangled state [9|, it is unknown whether 
there exist general experiments with a finite number of 
measurement settings for which an infinitely large entan- 
gled state is required to obtain the maximum possible 
quantum violation exactly. 

In the context of bipartite Bell experiments, the ques- 
tion of determining the Hilbert space dimension from ex- 
perimental data has been addressed in the recent work 
of Brunner et al. jlCj ]. Their aim was to reproduce 
the statistics of an experiment performed by two sep- 
arated observers on a single preparation of a bipartite 
quantum system. They introduce the concept of dimen- 
sion witness, which is a certain kind of generalization of 
Bell inequalities that make it possible to distinguish the 
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strength of correlations that can be obtained in different 
dimensions. This very nice approach makes it possible 
to find interesting lower bounds on the dimension of the 
system in use and has recently been extended by Brief 
et al. ll] for XOR games. Our work finds rather dif- 
ferent bounds on Hilbert space dimension that are ob- 
tained by a very different method. The bounds apply to 
quantum mechanical systems with any number of parties 
(even one), and apply also to the case where the experi- 
mental data refers to an arbitrary number of preparations 
of the system. Our bound is of particular significance if 
the number of measurement outcomes for each party is 
small. 

The general problem we consider is this: Suppose we 
are given a set of preparations S of a given quantum sys- 
tem and a set of measurements M., each of which has 
outcomes a £ A [13] • We are given, perhaps as a re- 
sult of experiments, probabilities p{a\j,r) of obtaining 
outcome a when performing the measurement Mj £ M 
having prepared the system in state r £ S. However we 
do not know either an explicit density matrix p for the 
preparation r or a measurement operator M a such that 

p(a\m,r) = Tr(pM a ) = p{a\M,p) 

where we use M a to denote the measurement operator 
corresponding to outcome a of measurement j, and will 
simply write p(a\j, p) := p(a\j, r) from now on. How large 
does the dimension of the Hilbert space supporting the 
states p have to be? 

This question was recently raised in [l2|, GH which 
determined the number of (hidden) variables in an on- 
tological model necessary to reproduce the probabilities 
p(a\j, r). In particular, it was shown that if each mea- 
surement has only two outcomes, then for a particular 
ontological model the number of hidden variables must 
be greater than N = min(|<S|, 2'^'). Here, we prove a 
simple lower bound that shows that in the quantum set- 
ting the dimension of our space scales as 2 clogJV , where c 
is a constant depending on the probabilities above. Thus, 
if the number of states |<S| and the number of measure- 
ments \M\ is large, the dimension of the quantum state 
that we need cannot be significantly smaller. 

In the following, we first prove a simple lower bound 
for this general problem. We then examine how we can 
use this to lower bound the dimension of the entangled 
state in a Bell experiment, and provide a simple example. 
In the appendix, we show that this example disproves 
that the smooth min-entropy is additive and that we can 
perform exact min-entropy splitting as for independent 
states, which is of interest in the noisy-quantum-storage 
model QSGJ, Git 

Throughout this note, we use h(p) :— —p\ogp — (1 — 
p) log(l — p) to denote the binary entropy, where all log- 
arithms are taken to base 2. We furthermore use S(H) 
to denote the set of all quantum states on the Hilbert 
space Ti., and write H(p) :— — Tr(plogp) for the von 
Neumann entropy of a state p £ S(Ti). Note that if p is 
classical, this reduces to the Shannon entropy, and that 



log(dim(«)) > H(p) > Theorem 11.8], since we 
may equivalently write H (p) = — . Xj log Xj where Xj 
is the j-th eigenvalue of p. We will also need the concept 
of a cq-state p x ® £ S(H. X ® TiP), a state that is part 
classical, part quantum, of the form 

where Px is a probability distribution over X and 
{x\x') = for x f x'. Let p x = Tr Q (p x ®) and 
pQ = Tix{p x ®) be the reduced states on systems 7i x 
and TiP respectively. The conditional von Neumann 
entropy is defined as H(X\Q) := H(p XQ ) - H{p Q ). 
We will also use the shorthands H{X) :— H(p x ) and 
H{XQ) := H{p x Q) and [I] := {1, . . . , £}. 

I. LOWER BOUND 

We first state the intuition behind our simple lower 
bound, based on quantum random access codes. A quan- 
tum (m, g,p)-random access code is an encoding of an 
m-bit string x into a g-qubit state p x such that for any 
i £ [m] we can retrieve the bit Xi from p x with proba- 
bility p. Note that we are only interested in retrieving a 
single bit of the original string x from p x . In general, it 
is unlikely that we will be able to retrieve more than a 
single bit. For such encodings it is not hard to prove a 
lower bound on the number of qubits q [l8| if the distri- 
bution over the strings is uniform and the probability of 
decoding each bit is the same. 

Now note that our problem has a very similar flavor. 
Suppose we were given states p\ , . . . , pi and measure- 
ments Mi , . . . , M m that give us the desired probabilities 
p(a\Mj, p x ). For simplicity, assume for now that I = 2 rn 
and a £ {0, 1}. Then the states px, . . . , pi form a gen- 
eralized quantum random access code, where each state 
represents an encoding of an m-bit string x and we think 
of Mj as the measurement that we can apply to extract 
bit Xj with probability p(a \ Mj, p x ). Once we realize this 
viewpoint it is indeed very intuitive that we should be 
able to apply techniques similar to the ones used for 
quantum random access codes also in the present setting. 

A. Tools 

We first state a general lemma from which our bound 
later follows by constructing an appropriate mapping 
that associates a string x with a state p x . Our proof is a 
straightforward extension of the techniques employed for 
the random access code lower bound [l|| m, H0| to more 
generalized distributions and alphabets: 

Lemma 1.1. Let X = A xm denote the set of strings 
of length m, let Px be a probability distribution over 
X and let X = Xi , . . . , X m denote a random variable 
chosen from X according to the distribution Px- Let 
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H Q be a Hilbert space supporting an an ensemble of 
states {P x (x),p x | p x E S(H Q ),x £ X} and POVMs 
Ej = (Ej 3 ), j £ [to] with outcomes Zj £ A. Let Zj 
be the random variable corresponding to the decoding of 
Xj by performing the measurement Ej on TiP where we 
use PzjiXjizjlxj) '■= Ti'{Ej J p x ) to denote the conditional 
probability distribution of a random variable Zj over A. 
Then 

dim(H Q )>2 H W-^ H ( x ^\ 

Proof. Consider a cq-state p XQ £ S{H X <g> H Q ) of the 
form 

p XQ = Y J Px{x)\x){x\®p^, 

X 

where := p x . We have 
log(dim(W Q )) > H(Q) > H(Q) - ^ P x (x)H(p®) 

X 

= H(X) + H(Q) - H(XQ) 
= H(X)-H(X\Q) 

m 

> H{X)-Y,H{X j \Q), 

where the first inequality follows from H(Q) < 
log(dim(H <3 )) [13, Theorem 11.8.2], the second from the 
fact that for all p® we have H(p$) > [13, Theo- 
rem 11.8.1], the third equality from H(XQ) = H(X) + 
J2 x p x( x ) H (Px) 0. Theorem 11.8.5], the fourth from 
the definition of the conditional von Neumann entropy, 
and the last inequality from its strong subadditivity 
H(X 1 . ..X m \Q) = H(X\Q) < r H{X,\Q) [17, The- 
orem 11.16], where Xj is the random variable correspond- 
ing to the j-th entry of X. 

Finally, note that we can express the effects of a mea- 
surement M on TiP by performing a unitary operation 
U on p x Q (g) |*b)(zb| € S{H X ®H Q ®H z i) with |z )(z | 
an initial pure state of Ti Zj , followed by tracin g ou t the 
ancilla Ti Zj holding the measurement outcome [17fl . We 
then have H(Xj\Q) — H(Xj\QZj) since U is unitary, 
and H(Xj\QZj) < H(Xj\Zj) since conditioning reduces 
entropy [13, Theorem 11.15.1], from which the claim fol- 
lows. □ 

This means that if we want to encode a string of n 
dits [28[ into a number of qubits and attempt to recover 
the j-th dit with the j-th measurement, then we need 
at least H(X) - Tjj H {Xj\Zj) qubits. H(X J \Zj) quan- 
tifies the uncertainty about the j-th bit given the out- 
come of the j-th measurement. For instance, if the n dits 
are drawn uniformly and independently (H(X) = nlogd 
where d = \A\) and we wish to recover them perfectly, 
{H(Xj | Zj ■ ) = for all j) then we need n log d qubits to do 
so. In the other extreme, where Zj holds no information 
about Xj i.e. our recovery probability is no better than 
guessing, we have H{Xj\Zj) = H{Xj) = \ogd for all j 
meaning that we need no qubits at all for the encoding. 



Corollary 1.2. For the definitions as given in 
Lemma it furthermore holds that 

&\m{H Q ) > 2 H{x) -'^j {h( - p > )+( - 1 - p 3 )log{ \ A \- 1) \ 

where pj = J2xex Px{x) Tr(i5j J p x ) is the average recov- 
ery probability of the j-th entry of X when measuring Ej 
onU Q . 

Proof. The statement follows immediately from 
Lemma 11.11 and Fano's inequality giving H(Xj\Zj) < 
h(Pj) + (1 — Pj)l°g(l-4| — 1), where pj is the average 
probability of correctly decoding the j-th bit of X given 
access to Zj. □ 

Note that the bound further simplifies to 

dim(ft) > 2 m ^' l ( p » 

in the case where Xj is binary and Px is the uniform 
distribution for which H(Xj) = 1 for all j and the recov- 
ery probability for each bit is lower bounded by p > i. 
This last bound was first noted in the context of random 
access codes. Lemma 11.11 does in general give a better 
bound than Corollary 11.21 although it may be harder to 
apply since it requires more information about the distri- 
butions and will be less convenient for us when consider- 
ing non-local games where we may have limited informa- 
tion. Fano's inequality is tight for a distribution where 
the most likely outcome has probability pj and all others 
have probability pj / (|-4| — 1) and in this case Corollarv lI.2l 
gives exactly the same bound as Lemma iLTl 



B. Dimension bound 

We are now ready to use these tools to prove a lower 
bound for our problem. Intuitively, we let the states 
pi , . . . , pe. corresponding to the preparations r\ , . . . , ri 
represent encodings of TO-element strings x chosen ac- 
cording to a probability distribution Px from X := A xm . 
If £ < \A\ m , then this just means that some strings have 
probability of occurring. If £ > \A\ m , then there are more 
elements in our string than we wish to extract in which 
case our lower bound will not be any stronger than could 
be obtained by letting I = \A\ m . There is some free- 
dom in applying the above bound to our setting, since 
we are in general free to associate strings with states in 
any way we like, pick any of our available measurements 
Mj to decode Xj and finally we may also choose any prior 
distribution Px{x), since our bound should hold for any 
such prior. First, we associate strings x £ X with states 
p £ S by constructing a map gr,R as follows: Recall 
that without loss of generality, we may order the states 
in lexicographic order pi, . . . ,p£. Let T C [£] such that 
|T| = min(£, let R C X such that \R\ = |T|, and 
consider the set of one-to-one maps 

Qt.r ■= {gr,R ■ R -> T\ix ^ x' £ X,g T . R {x) ^ g T M( x ')}- 
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That is, any map associates a unique state p gT R ( x ) with 
each string x. Second, we now construct maps e : [I] — > 
[£] and c : A x [£] — > A, that specify which measurement 
Ej = M e ^ we will use to extract a particular entry Xj of 
x = xi, ... , x m from p gT s ( x ), for a potential relabeling 

of the outcomes as M^l = given by the map c. 
Let T> = {(e, c)} denote the set of all such collections of 
maps. Finally, we may choose Px to be any distribution 
over X, where we will assign probability Px{x) — to 
any x ^ R. Note that this means Px is effectively a 
distribution over R. If we take Px — Px x x . . . x Py m to 
be a product distribution over all strings, then e(j) = j 
is simply the identity, i.e., we will use measurement Mj 
to decode the j-th element of the string. 

We first of all show that Lemma 11.11 gives us a lower 
bound on the dimension of the quantum system for any 
distribution Px, T C [£], R C X and mapping gr,R, c, 
and e. This will be important in Section HH where such 
mappings are fixed when considering a particular non- 
local game. We state both consequences of Lemma 11.11 
and Corollary II. 21 explicitelv: 

Corollary 1.3. Let S = {p u . . . ,p e | pj G S(H Q )} be a 
set of states and let M = {Mi, . . . , M m | Mj = (Mj 1 ) G 
B{H Q ),aE A} be a set of POVMs satisfying p(a\j,r) = 
Tr(Mjp r ) for some given set of probabilities {p(a\j,r) \ 
a G A,j G [m],r G [£}}. Then for any T C [£], R C X 
with \R\ = \T\ — voin{l,\X\), and gx,R G Gt,r, ( e ,c) G 
I?, and any distribution Px over R C. X = A xm giving 
ensemble {Px(x), p gT R ( x )} we must have 



dim(H Q ) > 2 H ( x )-S J (ft(p J )+(i-p 3 )iog(l^l-i)) j 



where P z ,\\ 
more, 



P 9 t,r(x))- Further- 



where p 3 = J2 xeX p x( x ) TT ( M e(jj' 3) PgT,R{x)) is the av- 
erage recovery probability of the j-th entry of X when 
measuring M e y) on 

We are now ready to state our main result as an im- 
mediate consequence of Corollary 11.31 

Theorem 1.4. Let S = {p t , ...,pi\ Pj G S(H Q )} be a 
set of states and let M = {Mi, . . . , M m | Mj = (Mj 1 ) G 
B{H Q ),a G .4} &e a set of POVMs satisfying p(a\j,r) = 
Tr(Mjp r ) for some given set of probabilities {p(a\j, r) \ 
a e A,j e [m],r G [£}}. Then 

dim(H Q ) > 2 C \ 



with 



C:= ™ m ^ x 1P H(X)-J2H(X j \Z j ) 



where the maximization is taken over all subsets T C [£], 
R Q X with \R\ = \T\ = min(^, \X\), and probability 
distributions Px over R, and mappings gr,R G Gt,r, 
(e,c) G V, with P z . lXj (zj\ Xj ) := Tv(M c e ^' 3) p gT R{x) ). 



Note that if we fix T, R, gr,R and (e, c) and further- 
more restrict the maximization to product distributions 
Px = Pxi x ... x P Xm we have from Lemma lLTl combined 
with Corollary 11.31 that 

dim(7i; Q ) > 2^ Cj , 

where Cj = maxp x L{Xj;Zj) is the Shannon channel 
capacity, and I(Xj; Zj) = H(Xj) — H(Xj\Zj) is the mu- 
tual information. Unfortunately, we do not know how 
hard it is to evaluate the quantity C in general when 
maximizing over all parameters. However, since for fixed 
T, R, gx.R, ( e , c ) it is equivalent to computing the Shan- 
non channel capacity it may not be an easy task for ar- 
bitrary distributions p(a\j, r). 

Let's look at a very simple example taken from [l2l.[T3l]. 
that illustrates our bound. The entries of the following 
table correspond to the probabilities p(a\Mj, p x ), for the 
two possible states labeled using strings '00' and '11'. 



M 


a 


Poo 


Pu 


M 





1 





1 





1 


Mi 





1 


1/2 




1 





1/2 



Note that in this example X — {0, l} 2 consists of the 
possible strings of two bits, but only '00' and '11' occur 
with non-zero probability. For simplicity, suppose we are 
given these states with probability 1/2 each, and hence 
we have H(X) = 1. Note that we can distinguish the two 
states perfectly using the first measurement, and hence 
both encoded bits can be recovered perfectly px = pi = 1. 
By reference to Corollary 1.2 we see that at least a two- 
dimensional system is required to recover these statis- 
tics. If we can only perform projective measurements, 
then (l2l . [l3| says that we need more than one qubit. 
Note however that this is not the case for generalized 
measurements. To perform the second measurement M 2 
we can perform the first measurement Mi = {M"}, and 
output for outcome but for outcome 1 we flip a coin 
that gives us and 1 with probability 1/2 each. This cor- 
responds to letting M% = Mf + M^/2 and M\ = Ml/2. 
Hence, our bound is tight for this trivial example. Be- 
low, we provide a second example that is inspired by the 
CHSH inequality. 

Our analysis shows that it is indeed possible to ob- 
tain bounds on the dimension in the quantum setting, 
partially answering an open question from (l2l . [l3| which 
asked to find such bounds for projective measurements. 
In particular, note that if we choose a uniform prior 
over N = mind^l^^) possible states, and consider 
only two outcome measurements we have by Corollary 
1.2 that the dimension of the system must obey logd > 

E?ii JV ( 1 - h <Pi)) ^ lo & Nc with c = n^iC 1 - k(Pi))- 
This means that in the case where Pj is not arbitrar- 
ily close to 1/2, and N itself is very large, the dimen- 
sion required is not significantly different from the one 
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required by the ontological model [HI, [l3j]. It is worth 
considering the dependence on pj which seems to be 
absent from this particular ontological model. If we 
merely want to represent the data classically in a way 
such that we can extract an arbitrary bit Xj alone with 
probability pj — p and the prior distribution over the 
strings is uniform, it is known that there do exist classi- 
cal random access codes for which the dimension obeys 
logo? = log AT(l-/i(p))+0(loglogJV) Intuitively, the 
description of the ontological model includes much more 
information and hence has a larger size. When exam- 
ining information processing within such an ontological 
model, it may however be worth considering whether it 
has a better representation for a particular task at hand. 



II. NON-LOCAL GAMES 

We now show how our approach also leads to a lower 
bound on the dimension of the entangled state that two 
or more parties need to share in any Bell experiment, 
where we consider a bound for the CHSH inequality as a 
small example. In this case we can immediately compute 
the lower bound since all parameters Px, T, R, gr,R, 
and (e,c) are fixed. For the present purposes, it is con- 
venient to view Bell experiments as a game between two, 
or more, distant players, who cooperate against a spe- 
cial party. We call this special party the verifier. In a 
two player game with players Alice and Bob, the verifier 
picks two questions s £ S and t S T and sends them 
to Alice and Bob respectively. Alice and Bob then re- 
turn answers a € A and b G B to the verifier, who then 
decides according to a fixed set of public rules whether 
Alice and Bob win by giving answers a and b to questions 
s and t. To win the game, Alice and Bob may agree on 
any strategy beforehand, but can no longer communicate 
once the game starts. Classically, such a strategy con- 
sists of shared randomness. In the quantum setting, they 
may choose any entangled state as part of their strategy 
and agree on any measurements to be performed on this 
state. Without loss of generality we can thus think of 
the questions as measurement settings and the answers 
as measurement outcomes. 

More formally, the game is characterized by finite sets 
S, T, A, B, a distribution n : S x T — > [0, 1] according to 
which the verifier chooses his questions, and a predicate 
V : Ax BxS xT -> {0,1}, where V(a, b\s, t) = 1 if and 
only if a and b are winning answers given questions s and 
t. Let tta and txb be the marginal probability distribu- 
tions over S and T respectively. For simplicity, we also 
assume that we are dealing with a unique game, where V 
is defined in such a way that for each b, s, t there exists 
exactly one winning answer a for Alice. Our argument 
for the general case is analogous, and can be obtained by 
combining the correct answers into one, which effectively 
corresponds to performing a measurement with less out- 
comes. However, our proof just becomes much harder 
to read. For simplicity in our explanations, we will also 



assume that the possible answers are the same for each 
possible measurement setting. 

Let Pr[a|s] and Pr[6|f] be the probabilities that Alice 
and Bob return answers a and b given questions s and t 
respectively. Note that the no-signaling condition must 
hold and hence we may without loss of generality assume 
these probabilities to be independent of the other parties 
measurement setting. We now show how to use our ap- 
proach from above to lower bound the dimension of the 
entangled state that Alice and Bob need to implement 
such a strategy. We are not concerned with the question 
whether there actually exists a strategy for Alice and Bob 
to obtain said distribution. This can be verified using the 
techniques of @, H, 0] . 

The simple trick is to realize that when Bob performs 
a measurement on his part of the state, he prepares a 
certain state on Alice's end. Let xl denote the state 
that is prepared for Alice if Bob has measurement setting 
t € T and obtains outcome b € B. The probability that 
Alice holds the state xl is given by 

Px(t,b) :=Pt[b\t]n B (t), 

where we combine t, b to index a string x € -4' 5 ' as fol- 
lows: Note that since we are dealing with unique games, 
we can define a function / : B x S x T — > A such 
that f(b,s,t) = a for V(a,b\s,t) = 1. We can label Al- 
ice's measurements with numbers from one up to |<S| and 
hence without loss of generality we will take S = [\S\] to 
represent the set of possible measurements for Alice. We 
define the string x € A^ s ^ as 

x:=f(b,l,t),...J(b,\S\,t). (1) 

and let 

Px ■= xl 

Since x is a function of b and t, we have 

P x {x) :=P x (t,b). 

If Alice chooses measurement setting s she will try and 
give the correct answer a. Note that effectively she tries 
to retrieve the entry x s — f(b,s,t) from p x , completing 
the analogy to quantum random access codes. 

To apply Lemma lL2l let p(a\M s ,xl) be the probabil- 
ity that Alice outputs a for measurement setting s and 
prepared state x\- 

Corollary II. 1. In any non-local game where Alice ob- 
tains the correct outcome a € A s for measurement setting 
s € S with probability p s , the dimension of her Hilbert 
space 7i, A obeys 

dim(H A ) > 2 H(xs >-^s( h (p s )+(i-p s )iog(\A s \~i))_ 

where X is the random variable corresponding to the 
choice of string as defined in Eq. (Qp. 
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Evidently, an analogous statement can be made for 
Bob. If we are considering more than two players, it is 
straightforward to extend our argument to bound the 
Hilbert space dimension of each individual player by 
grouping the remaining players together as one. 

Let's look at a small example which illustrates the 
proof. Consider the CHSH game. Here, A = B = S = 
T = {0, 1} and 7r is the uniform distribution. Alice's goal 
is to obtain an outcome a such that s ■ t = a + b mod 2. 
Letting x = g(b, t) — f(b, 0, t),f(b, 1, t) we obtain an en- 
coding of a two bit string x £ {0, l} 2 as <?(0,0) = 0,0, 
.9(1, 0) = 1, 1, g{0, 1) = 1, and g(l, 1) = 0, 1. How many 
qubits does Alice need to use if she always wants to give 
the correct answer with probability 7 = 1/2 + l/(2\/2)? 
With analogy to the table of our previous example, we 
have probabilities p(a\M s , p x ) given by 



M 


a 


Poo 


P01 


P10 


P11 


M 



1 


7 

(1-7) 


7 

(1-7) 


(1-7) 
7 


(1-7) 
7 


Mi 



1 


7 

(1-7) 


(1-7) 
7 


7 

(1-7) 


(1-7) 
7 



We have for all t,b & {0, 1} Pr[b\t] = 1/2 and hence 
Px(x) = Px(t,b) = 1/4. Since everything is uni- 
form we immediately obtain from Corollary III. II that 
log(dim(H A )) > (1 - H(p))2 m 0.8. Hence Alice needs 
at least one qubit to no great surprise. We do not need 
to know a specific strategy, however, for the well-known 
CHSH state and measurements we would have an en- 
coding of poo = |0)(0|, poi = |-)H, Pro = l+X+l, and 
P11 = |1) (1| which actually coincides with the best known 
quantum random access code for a 2 bit string. 

Bounds for other games for which we are given a dis- 
tribution over the measurement outcomes can be shown 
in an analogous way. In general, if we are given the full 
distribution over all settings and outcomes we can apply 
the first part of Corollary 11.31 to obtain a slightly better 
bound, depending on the distribution. 

III. MIN-ENTROPY 

Our task of lower bounding the dimension of the 
Hilbert space can be used to give a partial answer to 
an open problem in the analysis of cr ypt ographic proto- 
cols in the bounded-quantum-storage [2ll . |22j , and noisy- 
quantum-storage model [lH [l5l ITq ]. In particular, the 
example discussed in the previous section can be modi- 
fied to give a simple counterexample that shows that an 



additivity property of the smooth min-entropy that has 
been shown to hold for independent quantum states fiH ] 
is not true in general. Note that a modified version may 
still hold with additional loss in the parameters. The 
same counterexample can also be used to show that exact 
min-entropy splitting with respect to quantum knowledge 
as it holds for independent states [lj| is not possible in 
general without imposing further assumptions. We defer 
the details of this construction to the appendix. 



IV. CONCLUSION 

We have given a simple lower bound that places a fun- 
damental limit on how large the dimension of the state 
has to be to implement certain measurement strategies. 
Our result shows that in the limit of a large number of 
measurement settings and states, the dimension of this 
state cannot generally be significantly smaller than the 
amount of classical information (e.g. in the form of (hid- 
den) variables in an ontological model [12, EH ) necessary 
to produce the desired statistics. 

Our approach also gives a weak bound on the dimen- 
sion of the entangled state needed to implement non-local 
strategies for any multi-player non-local game. Note, 
however, that our bound will be quite weak if the proba- 
bility of outputting the correct outcome is close to 1/2, or 
the number of measurement outcomes is large. Further- 
more, note that our bound also works for the case where 
the choice of Alice's measurement settings is uniform 
which may not be the case for a particular game, leaving 
the possibility of better bounds. Yet, our approach is 
a first direction to find bounds for general games. It is 
an interesting question whether the the present idea of 
viewing the game as an encoding procedure leads to new 
upper bounds as well. 
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APPENDIX A: MIN-ENTROPY 

In this appendix, we describe the counterexample men- 
tioned in the text showing that the additivity property 
that was proved recently for the smooth min-entropy of 
independent quantum states does not hold in general. 
The same example can also be used to show that min- 
entropy splitting with respect to quantum knowledge as 
it was shown for such states does not hold in general, 



without imposing additional constraints. 

1. Definitions 

To state the additivity lemma, we will need the fol- 
lowing quantities introduced by Renner (23j . reproduced 
here for convenience: Let pab & S(Tt A <8> Tt B ) and let 
o B e S(H B ). Then the min-entropy of pab relative to 
ob is given by 

~Roc(pab\ob) ■= - log A, 

where A is the smallest real number such that A1a®ob > 
Pab- We need a related quantity, where in addition we 
optimize over states ob defined as 

Hoo(pab\B) := sup B-oo(pab\o-b) ■ 
o B es(H B ) 

For a cq-state Pxe, we also use the shorthand 

Hoo^l^) := sup HooIpxeIoe) 
<j E es(H B ) 

for the conditional min-entropy of X given E. It is dif- 
ficult to get an intuitive understanding from this formal 
definition of conditional min-entropy, but one can show 
using semi-definite programming duality [24j that 

R 00 (X\E) = -log P g (X\E), (Al) 

where P g (X\E) is defined as the maximum success proba- 
bility of guessing X by measuring the £?-register of pxE- 
Formally, for any (not necessarily normalized) cq-state 
Pxe, the guessing probability is defined as 

P g (X\E) := sup y^P x {x)Tr{M x p%), 

where the supremum ranges over all positive-operator 
valued measurements (POVMs) with measurement ele- 
ments {M x } xeX , i.e. M x > and V , M a = I E . If 
all side-information is classical, we recover the fact that 
the classical min-entropy is the negative logarithm of the 
maximum probability. 

We will also refer to smooth versions of these quanti- 
ties. Intuitively, we no longer consider the min-entropy 
of a fixed state pab , but allow us to move to some pab 
which is close to pab, but may have considerably larger 
min-entropy. These smooth quantities are often needed 
since they have some nicer properties than the conven- 
tional min-entropy. For e > 0, the e-smooth min-entropy 
of pab relative to ob is given by 

{pab\ob) ■= sup Hoo (pab | o b ) , 

Pab&>C e (pab) 

where JC%p AB ) := {pab e V(H A ® H B ) \ \\p AB - 
Pab\\i < Tt(pab)s and Tt(pab) < Tr(p A B)}- Finally, 
we need the related quantity of the e-smooth min-entropy 



8 



of Pab relative to B defined by Renner, where we now 
again maximize over all states <tb S S{7i B ): 

H L (pab\B) := supH^ (pabWb) ■ 



We also use the shorthand 

2. Additivity 

In [l6l . Lemma 2.2] it was shown that for two indepen- 
dent quantum states pxiEi and Px 2 e 2 we have 

Ko (^il^i) + Ko (x 2 \e 2 ) > nC(XiX 2 \E), 

where E — E\E 2 , where E\ and E 2 are independent. 
Hence, one might hope that something similar holds for 
a general ccq-state, in particular that we have 

(X X \E) + H^ (X 2 \E) > Rt(XiX 2 \E). (A2) 
However, we now show that there exists a cq-state 

Px x x 2 e= ^2 Px lX2 \xix 2 ){xix 2 \ ®pf, 
xix 2 e{o,i} 

that violates this statement for small e. 

From the chain-rule for the smooth min-entropy, and 
the data-processing inequality [23j, Theorem 3.2.12] we 
have 

H^(M 2 |£) > Hl o (X 1 X 2 E)-H (E) 
> R e oo (X 1 X 2 )-H (E) 

Using that Ho(E) = log rank pe we thus have 

logdim(H B ) > H^ {Xi.X 2 ) - H^ (X.X^E) . 

Now consider the CHSH example given above. Let p Xl x 2 
be the uniform distribution, and again let poo = |0)(0|, 
poi = | — > < — U Pw = l+X+l, and Pll = |1)(1|. The ran- 
dom variables X\ and X 2 here correspond to the choice 
of the first and second bit respectively. 

First, consider the case of e = 0. And suppose 
by contradiction that Eq. (|A2[) holds. Note that for 
our simple example we have for any D S {1,2} that 
Hoopffll-E) = - \ogt with t = 1/2+ l/(2\/2), since the 
min-entropy directly relates to the guessing probability 
as outlined in Eq. IjAll) . Hence, we would have 



However, we know that one qubit, i.e., logdim(7Y- E ) = 1, 
is sufficient for this encoding. For small e, we can 
make a similar argument by virtue of the fact that 
-log^A^-e] > H^ (XjlE) [H and H^ {X X X 2 ) > 

Hoo(^1^2)- 

Additivity of the smooth min-entropy was required as 
a tool to show a so-called min-entropy splitting lemma 
for independent quantum states [l6|. Intuitively, the 
technique of min-entropy splitting, first introduced by 
Wullschleger [26[ for classical min-entropy, states that if 
the min-entropy of two (or more) random variables XiX 2 
is high, then the min-entropy of either X\ or X 2 must be 
greater than half the joint min-entropy. Here, we are 
interested in the min-entropy of X\X 2 conditioned on 
quantum information. In particular, it was shown in [l6l . 
Lemma 2.7] that for s > and two independent states 
PXxEx and px 2 E 2 , satisfying 



H s oa (X 1 X 2 \E 1 E 2 ) > a, 



there exists a random variable D £ {1,2} such that 



logdim(W £ ) > H 0O (A 1 A 2 )-H 0O (A 1 A 2 |^) 
> 2 + 21ogi « 1.54. 



with E — E\E 2 . It was an open problem in [lq . 
whether this statement is also true for arbitrary ccq- 
states pxiX 2 E- Since additivity falls, it is no longer clear 
whether this would be true in general. By the same 
argument as above, one can also see that for X\ and 
X 2 being the random variables corresponding to the en- 
coding of the first or second bit respectively we cannot 
have that H^ {X X \E) > U 00 (X 1 X 2 \E)/2 or H 00 (X 2 \E) > 
H 00 (A 1 A 2 |S)/2. 

This small example shows that we must be very care- 
ful when trying to perform min-entropy splitting with 
respect to quantum information, and indeed one can also 
use the present example to disprove min-entropy split- 
ting for non-independent states. However, it does not 
rule out that such a statement is still true with a sig- 
nificant loss in the smoothing parameter e or by adding 
an additional fudge factor. Indeed, such statements in- 
volving additional factors are known if the number of 
random variables X\ , . . . , X n is small compared to the 
size of the set X over which the variables X\ , . . . , X n are 
distributed [25[ . Unfortunately though, they do not give 
nice bounds in our setting. 



